rank | frequency | n-gram |
---|---|---|
1 | 16022 | -s |
2 | 16000 | -e |
3 | 13553 | -n |
4 | 10646 | -a |
5 | 7532 | -o |
rank | frequency | n-gram |
---|---|---|
1 | 4022 | -on |
2 | 3913 | -en |
3 | 3869 | -es |
4 | 3437 | -le |
5 | 2291 | -ns |
rank | frequency | n-gram |
---|---|---|
1 | 2490 | -lle |
2 | 1206 | -ach |
3 | 1066 | -orf |
4 | 972 | -ons |
5 | 972 | -gen |
rank | frequency | n-gram |
---|---|---|
1 | 2107 | -ille |
2 | 975 | -dorf |
3 | 757 | -bach |
4 | 752 | -heim |
5 | 691 | -ngen |
rank | frequency | n-gram |
---|---|---|
1 | 1876 | -ville |
2 | 617 | -court |
3 | 609 | -ingen |
4 | 457 | -sdorf |
5 | 389 | - City |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings